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Abstract 

Edge bundling reduces the visual clutter in a drawing of a graph by uniting the edges into bundles. 
We propose a method of edge bundling drawing each edge of a bundle separately as in metro-maps 
and call our method ordered bundles. To produce aesthetically looking edge routes it minimizes a cost 
function on the edges. The cost function depends on the ink, required to draw the edges, the edge 
lengths, widths and separations. The cost also penalizes for too many edges passing through narrow 
channels by using the constrained Delaunay triangulation. The method avoids unnecessary edge-node 
and edge-edge crossings. To draw edges with the minimal number of crossings and separately within the 
same bundle we develop an efficient algorithm solving a variant of the metro-line crossing minimization 
problem. In general, the method creates clear and smooth edge routes giving an overview of the global 
graph structure, while still drawing each edge separately and thus enabling local analysis. 

1 Introduction 

Graph drawing is an important visualization tool for exploring and analyzing network data. The core 
components of most graph drawing algorithms are computation of positions of the nodes and edge routing. 
In this paper we concentrate on the latter problem. 

For many real-world graphs with substantial numbers of edges traditional algorithms produce visually 
cluttered layouts. The relations between the nodes are difficult to analyze by looking at such layouts. 
Recently, edge bundling techniques have been developed in which some edge segments running close to each 
other are collapsed into bundles to reduce the clutter. While these methods create an overview drawing, 
they typically allow the edges within a bundle to cross and overlap each other arbitrarily, making individual 
edges hard to follow. In addition, previous approaches allow the edges to overlap the nodes and obscure 
their text or graphics. 

We present a novel edge routing algorithm for undirected graphs. The algorithm produces a drawing in 
a "metro line" style (see Fig. [I]). The graphs for which our algorithm is best applicable are of medium size 
with a large number of edges, although it can process larger graphs efficiently too. 

The input for our algorithm is an undirected graph with given node positions. These positions can be 
generated by a graph layout algorithm, or, in some applications (for instance, geographical ones) they are 
fixed in advance. During the algorithm the node positions are not changed. The main steps of our algorithm 
are similar to existing approaches, but with several innovations, which we indicate with italic text in the 
following description. 

Edge routing. The edges are routed along the edges of some underlying graph and the overlapping 
parts are organized into bundles. One approach here has been to minimize the total "ink" of the bundles. 
However, this often produces excessively long paths. For this reason we introduce a novel cost function for 
edge routing. Minimizing this function forces the paths to share routes, creating bundles, but at the same 
time it keeps the paths relatively short. Additionally, the cost function penalizes routing too many edges 
through narrow gaps between the nodes. We furthermore show how to route the bundles around the nodes. 

Path construction. In this step the paths belonging to the same bundle are "nudged" away from each 
other. The effect of this action is that individual edges become visible and the bundles acquire thickness. 
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Figure 1: Edge bundling example (jazz graph) 



Contrary to previous approaches, we try to draw the edges of a bundle as parallel as possible with a given 
gap. However, such a routing might not always exist because of the limited space between the nodes. We 
provide a heuristic that finds a drawing with bundles of a suitable thickness. 

Path ordering. To route individual edges, an order of the edge segments within each bundle needs to 
be computed. This order minimizes the number of crossings between the edges of the same bundle. The 
problem of finding such an order is a variant of the metro-line crossing minimization problem. We provide 
a new efficient algorithm that solves this problem exactly. 

The next section summarizes related work. In Section[3]we introduce the ordered edge bundling algorithm, 
followed by the detailed explanation of the algorithm steps in Sections i][6 Results of experiments are 



presented in Section [7] We discuss some additional issues and future work in Section [H] 



2 Related work 

Edge routing. The problem of drawing edges as curves that avoid node interiors has been considered in 
several graph drawing papers. Dobkin et al. [10 initiated a visibility-based approach in which the shortest 
obstacle-avoiding route for an edge is chosen. The method requires the computation of the visibility graph, 
which can be too slow for graphs with hundreds of nodes. An improvement was introduced by Nachman- 
son et al. [13] . achieving faster edge routing using sparse visibility graphs. The latter method constructs 
approximate shortest paths for the edges. We use the method as the starting point of our algorithm. 

Dwyer et al. |12) proposed a force-directed method, which is able to improve a given edge routing by 
straightening and shortening edges that do not overlap node labels. However, a feasible initial routing is 
needed. Moreover, the method moves the nodes, which is prohibited in our scenario. 

The problem of finding obstacle-avoiding paths in the plane arises in several other contexts. In robotics, 
the motion-planning problem is to find a collision-free path among polygonal obstacles. This problem is 
usually solved with a visibility graph, a Voronoi diagram, or a combination thereof (30) . Though these 
methods can be used to route a single edge in a graph, they do not apply directly to edge bundling. Many 
path routing problems were studied in very-large-scale integration (VLSI) community. We use some ideas 
of [51 IB] as will be explained later. 

Edge bundling. The idea of improving the quality of layouts via edge bundling is related to edge 
concentration [26] . in which a layered graph is covered by bicliques in order to reduce the number of edges. 
Eppstein et al. [9|. 114] proposed a representation of a graph called a confluent drawing. In a confluent drawing 
a non-planar graph is represented by merging crossing edges to one, in the manner of railway tracks. 

We believe that the first discussion of bundled edges in the graph drawing literature appeared in [17) . 
There the authors improve circular layouts by routing edges either on the outer or on the inner face of a 
circle. In that paper the edges are bundled by an algorithm that tries to minimize the total ink of the 
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drawing. Here we follow a similar strategy, but in addition we try to keep the edges themselves short (among 
other criteria). 

In the hierarchical approach of [21] the edges are bundled together based on an additional tree structure. 
Unfortunately, not every graph comes with a suitable underlying tree, so the method does not extend directly 
to general layouts. In |29) edge bundles were computed for layered graphs. In contrast, our method applies 
to general graph layouts. 

Edge bundling methods for general graphs are given in [7J [TBI [55J [53] . In the force-directed approach [55] 
the edges can attract each other to organize themselves into bundles. The method is not efficient, and while 
it produces visually appealing drawings, they are often ambiguous in a sense that it is hard to follow an 
individual edge. The approaches [7[ [53] have a common feature: they both create a grid graph for edge 
routing. Our method also uses a special graph for edge routing, but our approach is different; we modify 
this graph to obtain better edge bundles. And unlike [3 [151 [22], we avoid edge- node overlaps. 

Recently there were several attempts to enhance edge bundled graph visualizations via image-based 
rendering techniques. Colors and opacity were used to highlight the density of overlapping lines [7j 121] , 
Shading, luminance, and saturation encode edge types and edge similarity |15j . We focus on the geometry 
of the edge bundles only. The existing rendering techniques may be applied on top of our routing scheme. 

Path ordering. The problem of ordering paths along the edges of an embedded graph is known as 
metro-line crossing minimization (MLCM) problem. The problem was introduced by Benkert et al. 0, and 
was studied in several variants in [2j [3J [JJ [27]. We mention two variants of the MLCM problem that are 
related to our path ordering problem. 

Asquith et al. [5] defined the so-called MLCM-FixedSE problem, in which it is specified whether a path 
terminates at the top or bottom of its terminal node. For a graph G — (V, E) and a set of paths P they give 
the algorithm with 0(|i?| 5 / 2 |P| 3 ) time complexity. A closely related problem called MLCM-T1, in which all 
paths connect degree-1 terminal nodes in G, was considered by Argyriou et al. [JJ. Their algorithm computes 
an optimal ordering of paths and has a running time ofO((|-E| + |P| 2 )|P|). Recently, Nollenburg [57] presented 
an 0(|U| |P| 2 )-time algorithm for these variants of the MLCM problem. Our algorithm can be used to solve 
both the MLCM-FixedSE and MLCM-T1 problems with complexity 0(|V||P| + |V"| log ]V"| + \E\). To the 
best of our knowledge, this is the fastest solution to date of these variants of MLCM. 

Crossing minimization problems have also been researched in the circuit design community [191 1241 [5] . 
Given a global routing of nets (physical wires) among the circuit modules, the goal is to adjust the routing 
so as to minimize crossings between the pairs of the nets. In this context, a problem similar to the path 
ordering problem was studied by Groeneveld [Tj5], who suggested 0(|V| 2 |P|) algorithm. Another method, 
which works for graphs of maximum degree 4, is given in [5]. Marek-Sadowska et al. [53] considered the 
problem of distributing the set of crossings among circuit regions. They introduced constraints for the 
number of allowed crossings in each region, and provided an algorithm to build an ordering with these 
constraints. Their algorithm has 0(|F| 2 |P| + |U|£ 3 / 2 ) complexity, where £ is the number of path crossings. 

3 Algorithm 

In this section we introduce some terminology and give an overview of our algorithm. The input to the edge 
bundling algorithm comprises an undirected graph G = (V,E), where V is the set of nodes and E is the 
set of edges. Each node of the graph comes with its boundary curve, which is a simple closed curve. The 
boundary curves of different nodes are disjoint, and the interiors they surround are disjoint too. The output 
of the algorithm is a set of paths P = {n st : st e E}, where for an edge st G E its path is referenced as ir st ; 
it is a simple curve in the plane connecting the boundary curve of s with the boundary curve of t. The input 
also includes a required path width for each edge, and a desired path separation. 

Step 1 of the edge bundling algorithm produces an initial routing of the paths. This routing is done on 
an auxiliary graph G called the routing graph. The edges of G are straight-line segments on the plane. 

Step 2 improves the routing. Firstly, G is cleaned up by removal of all its nodes and edges that are not 
part of any path. Secondly, G is modified to accommodate the paths together with the path widths and 
separations, and to improve the paths. 

In steps 1 and 2 we use a notion of ink similar to [T7J [55] . Given a graph embedded in the plane with 
straight-line edges, the ink I of a set of paths in the graph is the sum of the lengths of the edges of the 
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Figure 2: Algorithm pipeline. 



graph used in these paths; if an edge is used by several paths its length is counted only once. A set of paths 
sharing an edge of the graph is called a bundle. It is important to note that in the final drawing the paths 
in a bundle will be drawn as separate curves, so I does not represent the amount of ink used in the final 
drawing. 

Step 3. Consider a bundle with the shared edge uv in G. We would like to draw the segments of the 
paths that run along uv as distinct curves parallel to uv. This requires ordering of the paths of each bundle. 
Step 3 produces such an order for all bundles in a way that minimizes crossings (which occur at the nodes) . 

Step 4 draws each path as a smooth curve using straight-line, cubic Bczier, or arc segments. 

The overall algorithm is illustrated in Fig. [2j and discussed in detail in the next sections. 



4 Edge Routing 

4.1 Generation of the Routing Graph 

As a preprocessing step, for each node s £ V we create a convex polygon O s containing the boundary curve 
of s in its interior. We keep the number of corners in O s bounded from above by some constant. We call 
O s an obstacle, and build the obstacles in such a way that O s and Ot are disjoint for different s and t. For 
each s € V we choose a point inside of the boundary curve of s, which of course is inside of O s too, and call 



this point a center of O s , see Fig. 3(a) 



The algorithm starts with the construction of the routing graph G. We follow the approach of [13] 
building a sparse visibility graph. The vertices of G comprise the obstacle centers, the obstacle comers, 
and some additional vertices from the boundaries of O s . These additional point are added to make G rich 
enough for path generation. The points added to O s have the property that each cone with the apex at the 
center of O s and with an angle given in ad vance, 7r/6 in our settings, contains at least one corner of O s , or 



at least one new added point, see Fig 4(a) To build the visibility edges of G adjacent to vertex u, the vertex 
is surrounded by a family of cones covering the plane with the apexes at u, and in each cone at most one 
edge is created by connecting u with another vertex of G which is visible from u, belongs to the cone, and 
is closest to u in the cone. The angles of the cones are tt/6 in our default settings. 

The method works in 0(r log r) time and creates G with 0(r) edges, where r the number of vertices in 

G. 

There are two types of edges in G: the visibility edges, and the edges from the obstacle centers. A 
visibility edge has both its ends belonging to the obstacle boundaries and it does not intersect the interior of 
any obstacle. An edge of the second type is adjacent to an obstacle center. Such an edge does not intersect 



the interior of any obstacle except of the one containing the center, see Fig. 4(a) We denote the set of nodes 
of G by W 7 and the set of edges of G by U. By construction r = 0(\V\), and therefore G has 0(| V|) nodes 
and 0(|V|) edges. 
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Figure 4: A fragment of G is shown, (a) The boundary curves are rendered with dotted curves, the obstacles 
with dashed curves, and the edges of G with solid curves, (b) The obstacles are shrunk, so the visibility 
edges of G do not cross or touch the obstacles. 



Further on, it is necessary for our method to keep the visibility edges of G disjoint from the obstacles. 
For this purpose after computing G we shrink the obstacles slightly, so that each obstacle still contains its 
boundary curve, but the visibility edges do not touch or intersect the obstacles anymore, see Fig. |4(b)| 

Next we route the paths on G. 



4.2 Routing Criteria and Routing Cost 

To create bundles we apply the idea of ink minimization [17, 251ll6|. However, minimizing the ink alone does 
not always produce satisfactory results; some paths in a routing with a small ink might have sharp angles 
or long detours as shown in Fig. |3(b)| We improve the paths by introducing additional criteria. 

• Short Path Lengths. The length of a path ir st is denoted by £ st . One approach would be to minimize 
the sum of t s %. However, this gives insufficient weight to paths that connect relatively close pairs of 
nodes. For this reason we instead consider the normalized lengths £ st /\st\, where \st\ is the distance 
between the centers of O s and Ot- 

• Capacity. We would like to avoid routing too many paths through a narrow gap. For this purpose 
we adapt an idea of [H[S], where capacities are associated with segments connecting obstacles. To 
construct these segments, we compute the constrained Delaunay triangulation (CDT) [IT] with the 
constrained edges being the obstacle sides. A Delaunay edge connecting two different obstacles is 
called a capacity segment. We penalize for too many paths passing through a capacity segment. 

Let us define capacity c a of a capacity segment a. Suppose a connects points a G A and b 6 B for 
obstacles A and B. Then c„ = (\a,B\ + \b,A\)/2, where \a,B\ is the distance from a to B, and \b,A\ 
is the distance from b to A. 

During routing a path we assign it to the capacity segments intersected by the path, see Fig. [5j For 
each capacity segment a we select all paths assigned to a, and calculate routing width w a as the 
total path widths plus the total path separation. Capacity overflow penalty of a is defined as 
Per = (w a — c CT ) + . The total capacity overflow penalty is defined as C = J^aPv where the sum is 
taken over all capacity segments. 
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Figure 5: The solid curve represents a path, the long dashed opaque line segments represent the capacity 
segments crossed by the path, short dashed line segments represent the shrunk obstacle boundaries. 

Minimizing the ink, the paths lengths, and the capacity overflow penalty often is impossible at the same 
time. For example, smaller ink usually leads to longer paths. The capacity penalty can force some overlapping 
paths to go through different channels thus possibly increasing the ink. To find a solution satisfying all the 
criteria we consider a multi-objective optimization problem: minimize the routing cost of a set of paths P 
defined as 



We stress here that the routing cost is used only in steps 1 and 2 of our algorithm. Steps 3 and 4 
producing the final routing preserve the overflow penalty and try to preserve the path lengths. However the 
notion of ink, as we defined it, is not applicable to the final routing. 

4.3 Routing of Paths 

We try to route the paths on G with the minimal routing cost. The problem is formulated as follows. 

Problem 1 (Path Routing) Given the graph G — (W,U) and a set of pairs of nodes (sjtj) £ W 2 , find 
paths between Sj and ti for all i so that routing cost is minimized. 

Here Si and ti correspond to the centers of obstacles connected by the edges of E. Problem [I] is NP- 
hard, because its instance with k in ^ = 1, fc; en = k cap = is a Steiner Forest Problem |18j . Therefore we 
suggest a heuristic to optimize routing cost. In this heuristic we solve an easier task, where some paths are 
already known and we need to route the next path. We will route it by minimizing an additional cost, which 
is the increment of the routing cost associated with this path. For a path n st we have additional cost = 
kink A/ + ki en £ st /\st\ + k cap AC. Here AJ is the increment in the ink, which is the sum of edge lengths of 
it st that were not part of any previous path; AC is the growth of X^Pcm where the sum is taken over all 
capacity segments a assigned to path 7r S (. 

Problem 2 (Single Path Routing) Given the graph G = (W, U) and a set P of already routed paths on 
G, find a path from s to t so that the additional cost is minimized. 

To solve the problem, for each edge e of G we assign weight k ink 5 e + k len t e /\st\ + k cap AC e to e . Here 
S e is t e if e is not taken by a previous path, and otherwise. The value of AC e is the growth of ^2 a Pa- for 
the case the path is passing through e, where the sum is taken over all capacity segments a crossed by e. 

It can be seen that the minimum of additional cost is achieved by a shortest path from s to t when we 
use these weights. We apply the Dijkstra's algorithm to find a path solving the problem. 

To find a solution for Problem]!] we organize paths in a sequence (sjii), . . . , (s m t m ), and iteratively solve 
Problem [2] for (s^tk), with the paths (s,ij), i < k already routed, for fc = 1, ...,m. 

The routing of a single path takes 0(|J7| log|W|) time with the Dijkstra's algorithm. Routing all paths 
amounts to 0(\E\ \U\ log \W\) steps. 

Problem 3 (Multiple Path Routing) Given the graph G with some paths already routed, find paths for 
(s*ii), . . . , (s*tk) so that additional cost is minimized. 




6 



t, 



t 2 



(a) 





Figure 6: An example for Problem [3] with ki n k = 2, ki en — 1, k cap — 0, and \s*ti\ — | s* i 2 1 = 1- The goal is to 
find paths s*£i and s*t2 with minimal additional cost, (a) Graph G, weights l e for each edge e are shown, 
(b) Constructed graph H with edge weights, (c) Optimal paths (blue) with additional cost = 17. 

We can solve this problem by a dynamic programming approach. We first fix a set of pre-existing paths 
in G; additional cost will always be with respect to these paths. Let us call a state a pair (v,P), where v is 
a node of G, and P is a subset of {ti, . . . , t^}- We need to solve our problem for the state (s* , {ti, . . . , £&}). 
We reduce the problem to solving it for "smaller" states, that are the states with fewer elements in P. For 
a state (v, P) we define its cost f(v, P) as the minimal additional cost of a set of paths {(v, t), t E P}. A set 
of paths giving the minimal f(v , P) is called an optimal set for state (v, P). Let us clarify the structure of 
an optimal set of paths. 

By the subgraph generated by a set of paths in G we mean the subgraph of G comprising all edges and 
nodes in the paths. 

Lemma 1 For each state there exists an optimal set of paths that generates a tree. 

Proof: Let II be any optimal set of paths for state (v,P), and G n be the graph generated by II, and note 
that it is connected. Let T be a shortest path tree of G n , rooted at v, with respect to ordinary edge lengths. 
Let II' be the set of paths connecting v to the points of P in T. The additional cost of IT' is at most that of 
IT. Indeed, the increment in / is no greater because T is a subgraph of G n . Each path of II' is shortest in 
G n and thus no longer than the corresponding path of II. Hence, II' is an optimal set for (v,P). m 



Lemma [T] leads us to the following formula 
f(v,P)=m 



f(u, P) + ki„k S V u + ki en J2teP ^vu/\s*t\, for u e W adjacent to v, 
f(v, P') + f(v, P-P'), for P' with OcF'cP 



The minimum is taken over both expressions on the right as u and P' vary. To verify this, we consider 
some optimal set of paths for (v, P) that form a tree, and split into two cases. The first line corresponds to 
the case where u is the only neighbor of v in the tree. The second line is the case where v has at least two 
neighbors, thus the paths can be partitioned into two proper subsets with no common edges. 

Now we describe how to compute f(v, P). Let us assume that / is known for all states (u, P') for P' a 
proper subset of P. To compute f(v, P), a new graph H is constructed with G as a subgraph. An edge e of 
G has weight ki n h S e -(- k[ en ^^^p^e/| s *^| in H. We add a new node h to H and connect it with all nodes 
of G. For every new edge hu we assign weight minp/ f(u, P') + f(u, P — P'), where P' varies over proper 
non-empty subsets of P (see Fig. [6]). One can see that the required value f(v, P) is the length of a shortest 
path from v to h in graph H. We can compute it using Dijkstra's algorithm. 

To solve Problemk^we work bottom-up. We first compute all f(v, P) with |P| = 1 and v is a node of G, 
by the algorithm for Problem [2j where we find a path with the minimal additional cost. Then we compute 
the values f(v, P) for each v and \P\ = 2, . . . , k by creating the corresponding graphs H . Finally, the answer 
for the problem is f(s*, {ti, . . . 
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(a) (b) (c) (d) 

Figure 7: (a) [ab] and [cd] are the bundle bases of edge uu' . The circles represent the hubs of the intermediate 
nodes, and the rounded rectangles represent the hubs of the obstacle centers. Segments 7r ul ,, the hub segment, 
and w vw for the dashed path, (b) An example of bundle bases when not enough free space is found, (c) An 
example of a biarc consisting of two arcs a and b. (d) The hub radius should be large enough that a < § 
and /3 > path separation. 



Running time The main steps of the above algorithm are the construction of graph H, and finding a 
shortest path on it with the Dijkstra's algorithm for each state (v, P). Luckily, graph H depends only on the 
P component of a state. The construction of graph H for a fixed set P takes 0(2l p l|W|) time. We execute 
the Dijkstra's algorithm only once per P starting from h to compute f(v,P) for all v £ W. Thus, finding 
f(v, P) for a known H and for all v £ W takes C>(|VF| log \ W\) time. Summing over all possible sets P gives 

o(j2[ 2lPl \ w \ +\W\log\W\f) =0(3 k \W\ + 2 k \W\log\W\). 



5 Path Construction 

5.1 Structure of the paths 

Before proceeding to the optimization of G, we need to give more details on the structure of the paths that 
we produce and define some additional constructions on G. 

We call a node v £ W an intermediate node if it is not an obstacle center. Recall that a bundle is 
a set of paths sharing the same edge of U. That means that as soon as the paths are routed on G, the 
bundles become defined. In the final drawing we would like to draw the paths of a bundle respecting the 



width and path separation and outside of the obstacles. As described in Section 4.1 we shrink the obstacles 
after constructing G, therefore each intermediate node of W lies outside the obstacles; an obstacle intersects 
an edge of U only if the edge is adjacent to the obstacle center. To give a general idea of the final path 



structure, we refer Fig. 7(a) 



We surround each node of G by a simple closed curve. In the case of an obstacle center it is the boundary 
curve of the node, and in the case of an intermediate node it is a circle disjoint from the obstacles. We call 
these curves hubs, and denote the hub of node v by h v . For each node u £ W and each edge uv £ U we 
construct a line segment with both its ends belonging to h v . We call these segments the bundle bases of 
edge uv. When only one path passes through an edge its bundle bases collapse to points. 

Each path tt from the bundle of edge uv is represented by a straight line segment connecting the bundle 
bases corresponding to uv. We denote such a segment tt uv and refer it as a bundle segment. The bundle 
segments of different paths are drawn in a particular order, as discussed in Section [6] below. If path tt passes 
through consecutive nodes u,v,w, then we connect the segments tt uv and tt vw by a smooth curve called a 
hub segment. We construct the hub segment in a way that it is tangent to tt uv and n vw , and is contained in 
h v . For the hub segments we use biarcs, following [28 . A biarc is a smooth curve formed by two circular arcs 
with a common point, see Fig. |7(c) Therefore, in the final drawing each path is represented as a sequence 



of straight line bundle segments and biarc hub segments. 

Next we give a heuristic that moves intermediate nodes of G to find space for hubs and bundles. Ideally, 
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we would like to keep the bundle bases large enough to accommodate all the bundle segments with their 
widths and separations. However, this will not always succeed, as in Fig. |7(b)| 



5.2 Optimization Goals 

We change positions of intermediate nodes of G to provide enough space for drawing paths. For this we 
compute how much free space we need around bundles and hubs. The ideal bundle width for edge e, denoted 
by e w , is the sum of all path widths of the bundle, together with the total of the separations between them. 
For example, for a bundle with three paths of widths 1, 2 and 2, and a path separation of 1, the bundle 
width is 7. 

The radius of a hub for an intermediate node u is the minimum of two radii: a desired one, and an 
allowed one. The desired radius is at least fj,e w , where e S U is an adjacent to u edge, and /i is a positive 



constant (see Fig. 7(d) I. In our implementation, /i = In addition, the desired radius is chosen large 
enough that any two bundles are separated before entering the hub by at least the given edge separation. 
We also bound the desired radius from above by some predefined constant, to avoid huge hubs. The allowed 
radius should be small enough that: (a) the hub interior does not intersect other hubs, and (b) the hub is 
disjoint from the obstacles. 

We say that the hub of an intermediate node u is valid if conditions (a) and (b) above are satisfied, and 
in addition for each edge uv the straight line segment from the center of h u to the center of h v does not 
intersect any obstacle (if v is an intermediate node) or intersects only obstacle O v (otherwise). The routing 
graph is valid if all intermediate hubs are valid. 

Our optimization procedure finds a valid graph G while pursuing two objectives. First, the^radii of all 
hubs should be as close to the desired radii as possible. Second, we try to keep routing cost of G small. 



5.3 Routing Graph Optimization 

Let us denote by p v the position of node v of G. Before the optimization step we have G in a valid state. We 
find hub radii (perhaps very small) keeping G valid, but we might have not enough space for the hubs and 
the bundles. In particular, we might have hubs for which the desired radius is larger than the allowed radius. 
The optimization starts by trying to move each such hub away from obstacles into another valid position, 
where the allowed radius is closer to the desired radius than before. This is done in a loop, for which we 
initially set r to the desired radius of node v. A move is attempted along vector £ with the direction u Sl 

where u s = " p "> , and the sum is taken over all obstacles s intersecting the circle of radius r with p v as the 

\O.Pv\ 

center. The length of £ is Or for some fixed 6 > 1, which is 1.1 in our implementation. If the new position 
of v is invalid, we diminish r and repeat the procedure. We stop iterations for a node after 10 unsuccessful 
moves. 

After this step we obtain more space for the paths, but routing cost is typically increased. Therefore we 
again iteratively adjust thejpositions of intermediate nodes to diminish the cost. In one iteration we consider 
an intermediate node u of G and try to find a valid position p u which maximally decreases routing cost. We 
assume that our optimizations do not change the capacity segments crossed by each path; hence, we only 
try to find the local minimum of ink and path lengths components of the cost. Consider the function which 
shows the contribution of node u to routing cost: f(p u ) = k ink J2 v \PuPv\ + henY^ Wst (\PuPv\ + \PuPw\)/\st\, 
where the first sum is taken over all neighbors of u, and the second sum is taken over all paths passing 
through nodes v, u, w. To minimize f(p u ), we use gradient descent method by moving u along the direction 

D = k ink J2 v jf^y + kienYs-Kst ]h\ (|§S^T + l§S^) using a sma11 fixed step size ' If routin 9 cost is reduced, 
and the new position is still valid, we repeat the procedure, otherwise the move is discarded and we proceed 
with the next node. We pass a predefined number of times over all nodes of the graph; this number is 2 in 
our implementation. 

After this optimization procedure we also try to modify the structure of graph G, if it is beneficial. We try 
the following modifications: (1) shortcut an intermediate node of degree two; (2) glue adjacent intermediate 
nodes together. These transformations are applied only if they reduce routing cost while keeping the graph 
valid. 
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What is the complexity of the above procedure? Let c be the time required to find out if a circle, or 
a straight line segment, intersects an obstacle. Using an R-tree [20] on the obstacles, one can find out if a 
circle or a rectangle intersects the obstacles in 0(c log \W\) time. The number of edges in G is \U\, therefore, 
an iteration optimizing the position of every node of G can be done in 0(c|E/| log |W|) time. Since we apply 
a constant number of iterations, this is also a bound for the whole procedure. 

6 Ordering Paths 

At this point the routing is completed and the bundles have been defined. We draw the paths of a given 
bundle parallel to the corresponding edge, therefore two paths may need to cross at a node as shown in 
Fig. [8] The order of paths in bundles affects^ crossings of paths. Some path crossings cannot be avoided since 
they are induced by the edge crossings of G, while others depend on the relative order of paths along their 
common edges, and thus might be avoided. We would like to find the orders such that only unavoidable 
crossings remain. Let P be the set of simple paths in G computed by path routing. We address the following 
problem. 




(a) (b) 



Figure 8: Graph G with 4 terminal (squares) and 7 intermediate (circles) nodes, (a) Ordering with 6 
crossings, (b) Optimal ordering with 3 crossings. 

Problem 4 Given the embedded graph G and a set of simple paths P , find an ordering of paths for each 
edge of G that minimizes the total number of crossings among all pairs of paths. 

The complexity class of Problem [4] is unknown. However, in our setting paths satisfy an additional 
property that makes possible to solve the ordering problem efficiently. We now describe this property. 

For path (w 1; . . . , Vk) £ P the nodes v\ and Vk are called terminal, and the nodes v 2 , ■ ■ ■ , Vk-i are called 
intermediate. In our setting, the paths terminate at the nodes of G corresponding to the centers of the 
obstacles. These are the only nodes that can be terminal for paths. The rest of the nodes are intermediate. 
Thus we have: 

Path Terminal Property: No node of G is both a terminal of some path and an intermediate of some 
path. 

We thus consider the following variant of the crossing minimization problem. 

Problem 5 (Path Ordering) Given an embedded graph G and a set of simple paths P satisfying the path 
terminal property, compute an ordering of paths for all edges of G so that the number of crossings between 
pairs of paths is minimized. 

We say that two paths having a common subpath have an unavoidable crossing if they cross for every 
ordering of paths. An ordering of paths is consistent if the only pairs of paths with unavoidable crossings 
cross (and these only once). Clearly, a consistent ordering has the minimum possible number of crossings. We 
will prove that a consistent ordering always exists, and we provide two algorithms to construct one, solving 
Problem[5j The first algorithm is a modification of a method for wire routing in VLSI design proposed in |19j . 
We call it the "simple algorithm" since it utilizes a simple idea, and it is straightforward to implement. The 
second is a new algorithm that has a better computational complexity. 
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6.1 Simple Algorithm 



To find an ordering of paths we iterate over the edges of G in any order, and build the order for each edge. 
Let P uv be a set of paths passing through the edge uv. We fix a direction of the edge, and construct the 
ordering on it by sorting P uv . To define the order between two paths tti , ~ki £ P uv we walk along their 
common edges starting from u until the paths end or the following cases happen ; If we find an edge which 
was previously processed by the algorithm, we reuse the order of 7Ti and 7T2 generated for this edge. If wc 
find a fork node /, by which we mean the end of the common sub-path of the two paths, then the order 
between tti and W2 is determined according to the positions of the next nodes of the paths following /, the 
path turning to the left is smaller. Otherwise, if such a fork node is not found (which means that the paths 
end at the same node), we walk along the common sub-path in the reverse direction starting from node v, 
and again looking for a previously processed edge or a fork. The above procedure determines a consistent 
order for all pairs except of the pairs of coincident paths; such paths are ordered arbitrarily on the edge. 

This ordering creates unavoidable crossings only. As mentioned earlier, the above algorithm is a minor 
variant of one in (Tijj . The proof of correctness follows the same lines as in [TH] . Here we discuss the main 
differences between the two variants. First, for the sake of simplicity we do not modify the graph and 
the paths during computations. Second, in our setting the nodes of G might have an arbitrary degree; in 
particular, several paths might share an endpoint, and thus the degree of the terminals is not bounded. That 
is why to compare two paths we traverse their common subpath in both directions. The running time of 
our algorithm is also different from that in [T5]. Our algorithm iterates over the edges of G — (W, U), which 
takes 0(|[/|) time. For each edge it sorts paths passing through it. A comparison of two paths requires at 
most 0(| W|) steps (maximal length of a common subpath). Let \P e \ be a number of paths passing through 
edge e e U. Then the sorting on the edge takes 0(|W||P e | log |P e |) time. Since J2 e eu l-^el = ^' wnere L IS 
the total length of paths in P, and \P e \ < \P\, the algorithm runs in 0(|IF|Llog \P\ + \U\) time. However, 
this is a pessimistic bound. Normally, the length of a common subpath of two paths is much smaller than 
\W\, and the number of paths passing an edge is smaller than \P\, that makes the algorithm quite fast in 
practice. 

The above algorithm is straightforward to implement. However, the following slightly more complicated 
algorithm has a better asymptotic computational complexity. 

6.2 Linear-Time Algorithm 




Figure 9: Removal of node v; intervening steps; re-insertion of v. Paths of P v are numbered 1, . . . , 7, and 
are shown in braces when unordered, and then with placement indicating their orders. The removal of v 
defines the order between paths 4 and 6 on edge ve. The order between paths 2 and 3 was calculated in the 
intervening steps. 

The algorithm consists of two phases. Each step of Phase 1 involves the deletion of a node of G (Fig. [9J). 
For each non-terminal node v in turn we do the following. Enumerate the edges incident to v as e\, ea, . . . , et 
in clockwise order, and let v±, . . . , vt be the corresponding nodes adjacent to v. Represent each path ir e P v 
using edges e a and e& by an unordered pair {a,b}. For each pair {a, b} add a new edge v a Vb- Modify the 
path represented by {a, b} so that it goes directly from v a to Vb using this new edge. The new edges incident 
to v a should be inserted into v' a s clockwise order in the position previously occupied by e a , in the order 
determined by the indexes of e&. Finally, delete node v from the graph. 
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In Phase 2 the process is reversed. Initially, all non-terminal nodes have been deleted, and we re-insert 
them in the reverse order to which they were deleted, adding orders of paths to the edges in the process. 
Consider the re- insertion of node v. The new order of paths along edge e a is obtained by concatenating the 
orders of paths along the edges v a v b . 

We now give some details of the implementation. First, create a new graph H . Initially, H is the subgraph 
of G induced the union of all the paths of P. Each path in P is stored as a list of nodes in H. For an edge e, 
let P e denote the set of paths containing e. We assume that for every node v of H , the list of edges incident 
to v in clockwise order is given. Note that these lists are dynamic since H undergoes deletions of nodes. We 
keep track of the deletions in a forest F. Initially, F consists of isolated nodes corresponding to the edges 
of H. When a node is deleted and an edge e is replaced by new edges, we add them in F as children of the 
node corresponding to e. For instance, Pdv contains paths 1,4,5, and 7 in Fig. [9j When node v is processed, 
set Pdv is split into new subsets Pd e , Pd a , and Pdb- The (clockwise) order of subsets is important. Then 
we replace edge dv by edges de, da, and db in graph H, in paths Pdv, and in order of edges around d. We 
stress here that graph H might contain multiple edges after this operation, since there could be another path 
passing from d to any of e, a, or b. It is important to keep multiple edges and corresponding sets of paths 
separately. The first phase finishes when all non-terminals are deleted from H . 

In the second phase, we process each tree in F in bottom-up order from children to parents. The list of 
paths for a node of F is simply the concatenation of the lists of its children. The leaves of F correspond to 
the original paths of P. 

Theorem 1 Given the graph G = {W, U), a set of simple paths P satisfying the path terminal property, and 
a clockwise order of the edges around each node, the above algorithm computes a consistent ordering of paths 
in 0(| W | + \ U\ + L) time, where L is the total length of paths in P. 

Proof: Correctness. The ordering of paths computed by our algorithm is consistent since the deletion of node 
v adds only unavoidable crossings. Two paths tt% and ~ki will produce a crossing only when the last node of 
their common subpath is deleted and the clockwise order of the nodes around v is • • • v a • • • Vb • • • v a ' • • • vy ■ ■ ■ , 
where ir-y = ■ ■ ■ v a vv a > ■ ■ ■ and ir 2 — ■ ■ ■ v^Wy 

Running time. The time for processing node v (the deletion of v) is 0(1 + d v n + s v ), where d v> H is the 
degree of v in H at the current step and s v is the number of paths passing through v. The claimed bound 
follows since d Vt u < d v q + s v . ■ 

We note that the size of the input for Problem [5] is fi(|W| + \U\ + L), which is the same as the running 
time of our algorithm if the clockwise order of edges around nodes is part of the input. If this order is 
not given in advance, the time complexity of the algorithm is 0(|W| log \W\ + \U\ + L). Since the length 
of a path does not exceed \W\, we have L = \W\\P\. Therefore, another estimate for the running time is 
0(|W|log|W| + |tf| + |W||P|). 

6.3 Distribution of crossings 

Consistent orderings are not necessarily unique. We noticed that the choice of a particular one may greatly 
influence the quality of the final drawing. The following property might appear desirable. An ordering of 
paths is nice if it is consistent, and for any two paths, their order along all their common edges is the same 
(i.e. they may cross only at an endpoint of their common sub-path). We next analyze the existence of nice 
orders. 

Proposition 1 If G is a tree, then there is a nice order of paths P. 

Proof : To build a nice order of paths on a tree, we use the algorithm below. Initially, choose any terminal 
node to be the root, and then consider the edges starting at the leaves towards the root. The first time we 
encounter a common edge of two paths will be at the end of their common sub-path. Thus, the crossing 
between them (if needed) will take place on that edge. To ensure that the crossing goes to the endpoint of 
the sub-path, we direct the considered edge towards the root of the tree. ■ 
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Table 1: Statistics on test graphs and performance of the algorithm steps (in seconds). 



We found an example of (G, P) having no nice ordering. 
Proposition 2 There exists (G, P) that admits no nice ordering. 

Proof: Consider a gadget with four paths shown in Fig. [Toj It consists of a black path, two blue paths 
(called left and right blue path respectively) and a central red path. In any nice ordering the first (left) blue 
path must be above the black path or the second (right) blue path must be below the black path. Otherwise, 
if the left blue path is drawn below and the right blue path is drawn above the black path, then red and 
black paths do not intersect nicely (that is, their order along common edges is not the same). 



Figure 10: The gadget and 3 ways to draw it. 



Now consider the graph G with 7 gadgets in Fig. 11(a) where only black paths are shown. The gadgets 



do not share any paths except (a) gadgets 1 and 4 share the blue path, and (b) gadgets 1 and 7 share the 
blue path. In a nice ordering either the left blue path for path 1 is above it or the second one is below it. 



In the first case, the blue paths of gadget 4 do not satisfy the above property, see Fig. 11(b) (a). This is a 
contradiction. In the second case, the blue paths of gadget 7 do not satisfy the above property, as shown in 



Fig. 11(b) (b). Again, this is a contradiction, which proves that there is no nice ordering of considered paths 
on G. ■ 



7 Results 

We implemented our algorithm in the MSAGL package |25j . Edge bundling was applied for synthetic graph 
collections and several real- world graphs (see [55] for a detailed description of our dataset). Unless node 
coordinates are available, we used the tool to position the nodes. All our experiments were run on a 3.1 GHz 
quad-core machine with 4 GB of RAM. Table [I] gives measurements of the method on some test cases. 

Performance The table shows the CPU times of the main algorithm steps. As can be seen, ordered 
bundles can be constructed for graphs with several thousand of nodes and edges in less than a minute. For 
medium sized graphs with hundreds of nodes the algorithm can be applied in an interactive environment. 
The most time expensive step is Routing Graph Optimization. The explanation for its long running time is 
twofold. First, it continues to modify G as long as it reduces routing cost, and we do not have a good upper 
bound for the number of iterations. Second, the step makes a lot of geometric operations such as checking an 
intersection between circles, polygons and line segments. These operations are theoretically constant time 
but computationally expensive in practice. We conclude that the optimization of this step is the primary 
direction for the running time improvement of our algorithm. 
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(a) 



(b) 



Figure 11: (a) Graph G consisting of 7 gadgets. Each black path starts at a leaf marked by a number and 
ends at a leaf with the same number, (b) Two possible cases; the blue dots mark the location of the gadget's 
blue paths. 




(a) 




Figure 12: Migration graph, (a) Overview, (b) Detail. 



Selection of parameters We describe the influence of the parameters in routing cost on the final drawing. 
The weight of the ink component ki n k can be considered as the "bundling strength" , since larger values of 
kink encourage more bundling. The path length component is complimentary and has the opposite effect. 
The capacity component plays an important role for graphs with large and closely located node labels (see 
for example the narrow channel between nodes 52 and S5 in Fig. |14(b)| . Based on our experiments, we 
recommend selecting k\ en significantly larger than kmk, otherwise some paths can be too long. The capacity 
weight k cap is set larger by order of magnitude than the other coefficients. In our default settings ki n k = 1, 



Wen 



500, and k 



cap 



10(fcinfc + kl en ). 



We may also adjust the widths of individual paths and path separation (see Fig. 14). In the extreme case 



when the path separation and path widths are set to zero, we obtain a drawing with overlapped edges. 



Examples We now demonstrate ordered edge bundling algorithm on some real-world examples. A mi- 
gration graph used for comparison of edge bundling algorithms is shown in Fig. 12 In our opinion, on 



a global scale ordered edge bundles are aesthetically as pleasant as other drawings of the graph (see e.g. 
[7| I16 [ 122] , I23j). On a local scale, our method outperforms previous approaches by arranging edge intersec- 
tions. A smaller example of edge bundling is given in Fig. |13| It shows another advantage of our routing 
scheme. Multiple edges are visualized separately making them easier to follow (compare the edge between 
nodes Editor and Application on the original and bundled drawings). 



Limitations The main limitation of our method that we are aware of is in choosing radii and positions 
for hubs. Sometime we do not utilize the available space in the best way. We plan next to improve this 
heuristic. 
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|Command| I Application | 
(b) 



Figure 13: Tail graph, (a) Standard routing computed with the visibility graph approach, (b) Edge bundling 
routing. 



8 Conclusions and Future Work 

We have presented a new edge routing algorithm based on ordered bundles that improves the quality of 
single edge routes when compared to existing methods. Our technique differs from classical edge bundling, 
in that the edges are not allowed to actually overlap, but are run in parallel channels. The algorithm ensures 
that the nodes do not overlap with the bundles and that the resulting edge paths are relatively short. The 
resulting layout highlights the edge routing patterns and shows significant clutter reduction. 

An important contribution of the paper is an efficient algorithm that finds an order of edges inside of 
bundles with minimal number of crossings. As mentioned above, this order is not unique. The question of 
choosing the best order is a topic for future research. 

A further possible direction concerns dynamic issues of edge bundling algorithm. First, a user may 
want to interactively change a bundled graph or change node positions. In that case, a system should not 
completely rebuild a drawing, but recalculate affected parts only. Second, a small deviation of algorithm 
parameters (e.g. ink importance ki n k) may theoretically involve a full reconstruction of the routing, while a 
smooth transformation is preferable. 
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